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• Premise of the study: New micro satellite (simple sequence repeat [SSR]) primers were developed from Eucalyptus expressed 
sequence tags (ESTs) and optimized for genetic studies of the southwestern Australian tree E. gomphocephala, which is se- 
verely impacted by tree health decline and habitat fragmentation. 

• Methods and Results: A total of 133 gene-homologous EST-SSR primer pairs were designed for Eucalyptus, and 44 were 
screened in E. gomphocephala. Of these, 17 produced reliable amplification products and 11 were polymorphic. Between two 
and 13 alleles were observed per locus, and observed heterozygosities ranged from 0.172 to 0.867. All 17 EST-SSRs that am- 
plified E. gomphocephala cross- amplified to at least one of E. marginata, E. camaldulensis, and E. victrix. 

• Conclusions: This set of EST-SSR primer pairs will be valuable tools for future population genetic studies of E. gomphocephala 
and other eucalypts, particularly for studying gene-linked variation and informing seed-sourcing strategies for ecological 
restoration. 

Key words: ecologically important genetic variation; EST-micro satellite; Eucalyptus gomphocephala; Myrtaceae; SSR min- 
ing; tuart. 



The Australasian tree genus Eucalyptus L'Her. (Myrtaceae) 
comprises more than 700 species (Brooker, 2000) and is eco- 
nomically significant to the forestry industry. Vast genomic re- 
sources for model species have allowed molecular markers to 
be developed in related nonmodel taxa of conservation signifi- 
cance. Eucalyptus gomphocephala DC. (subg. Symphyomyrtus, 
common name tuart) is endemic to the Swan Coastal Plain of 
southwestern Australia, and is a culturally iconic species in the 
region. Severe pathogen-mediated tree decline (Cai et al., 2010), 
together with extensive habitat fragmentation in urbanized 
areas, has required rapid conservation actions for the species. 
To inform seed sourcing strategies for ecological restoration 
and forest management in the context of climate change, mea- 
sures of adaptive population genetic diversity and structure are 
critical, but no microsatellite markers (simple sequence repeats 
[SSRs]) have yet been tested or optimized for E. gomphocephala. 
Specifically, SSRs within gene coding regions such as ex- 
pressed sequence tags (ESTs) will be valuable for investigat- 
ing variation in functional and potentially adaptive regions of 
the genome. 
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Here, we design novel PCR primer pairs for SSRs within 
Eucalyptus ESTs, targeting those that are homologous to an- 
notated genes and therefore those with putative ecological rel- 
evance. For the first time, we optimize a subset for utility in 
population genetic analysis of E. gomphocephala to inform 
future conservation programs. To demonstrate the markers' 
broader utility in the genus, we report cross-species amplifi- 
cation in three additional ecologically important species (E. 
camaldulensis Dehnh., E. marginata Sm., E. victrix L. A. S. 
Johnson & K. D. Hill). This is the first set of gene-homologous 
EST-SSRs to our knowledge that have been reported and tested 
in natural populations of nonmodel eucalypt species of ecologi- 
cal and conservation significance. These EST-SSR primers add 
value to a recently expanding set of EST-SSRs being made avail- 
able for Eucalyptus, increasing the variety of genes that can be 
investigated in novel population and conservation genetic stud- 
ies of nonmodel species. 



METHODS AND RESULTS 

A total of 36,001 Eucalyptus EST sequences were downloaded from 
GenBank in 2007 and screened for microsatellite repeats with Tandem Repeats 
Finder (Benson, 1999) using default parameters, and organized using Tandem 
Repeats Database version 2.30 (Gelfand et al, 2007). A total of 1098 repeats were 
detected in 1073 (3%) of 36,001 ESTs. Sequences were clustered using the 
sequence assembly program CAP3 (Huang and Madan, 1999) with default param- 
eters, into a set of 128 nonredundant contigs and 401 singletons (Appendix SI 
and S2). A total of 154 of 529 (29%) ESTs were homologous to various anno- 
tated genes, following BLASTN searches of the National Center for Biotech- 
nology Information's nucleotide (nr/nt) collection, using an £- value threshold 
of 10~ 10 . PCR primers were designed for all 154 sequences using Primer3 
(Rozen and Skaletsky, 2000). Annotated gene homologs were specifically targeted 
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so that loci could inform hypotheses relating to adaptive genetic variation in 
future studies of E. gomphocephala. All primer pairs and putative gene func- 
tions are provided in Appendix S3. 

We prioritized a random subset of 44 EST-SSRs for further PCR optimiza- 
tion in E. gomphocephala. Primer pairs were synthesized by Gene Works Pty 
Ltd (Hindmarsh, Australia) and initially tested for amplification products on a 
screening panel of seven E. gomphocephala individuals. DNA was extracted 
from freeze-dried leaf material using a NucleoSpin 8/96 Plant II Core Kit, with 
buffer set PL2/3 (Macherey-Nagel GmbH & Co., Diiren, Germany), as per 
the manufacturer's instructions. PCRs were carried out in a total volume of 
10 jiL, containing 10 ng genomic DNA template, lx PCR Polymerization Buf- 
fer containing dNTPs, 0.2 uM of each primer, 0.44 units Taq DNA polymerase, 
and 2 mM of MgCl 2 (reagents from Fisher Biotech, Perth, Western Australia, 
Australia). Thermocycling was carried out as follows: denaturation at 94°C 
for 5 min; followed by 30 cycles of 94°C for 1 min, annealing at 55°C for 30 s, 
and extension at 72°C for 30 s; followed by a final extension at 72°C for 15 min. 
Annealing temperatures (T a ) were optimized for each locus and products were 
visualized on 2% agarose gels stained with SYBR Safe (Invitrogen, Carlsbad, 
California, USA). 

A set of 17 (39%) primer pairs reliably amplified a product of expected size 
in E. gomphocephala. These loci were screened for allele size polymorphism 
and genotyped in multiplex PCR of between two and four loci per reaction, 
using a commercial kit (PCR Multiplex Kit using Q Solution; QIAGEN, Hilden, 
Germany). Forward primers were 5' end-labeled with WellRED fluorescent 
dyes (D2, D3, D4; Sigma-Aldrich, St. Louis, Missouri, USA). Reactions were 
carried out in a total volume of 12.5 jiL, containing 5-30 ng DNA template, lx 
QIAGEN Multiplex PCR Master Mix, 0.5x Q-solution, and 0.2 uM of each 
primer (with some exceptions, Table 1). PCR was conducted with an initial ac- 
tivation step at 95°C for 15 min; followed by 30 cycles of 94°C for 30 s, anneal- 
ing at T a (Table 1) for 90 s, and extension at 72°C for 60 s; followed by final 
extension at 72°C for 10 min. Products were genotyped using a CEQ 8800 
Genetic Analysis System and analyzed using CEQ Fragment Analysis Software 
(Beckman Coulter, Brea, California, USA). Two loci (EGM09 and EGM24) 
exhibited amplification failures or dubious peaks in multiplex and were ana- 
lyzed in singleplex for further reactions. Following genotyping tests, 11 EST- 
SSRs were polymorphic and produced reliable electrophoretic profiles in 
E. gomphocephala (Tables 1 and 2). For these loci, we further investigated the 
most likely gene annotations by conducting a BLASTX search against proteins 
in the UniRef 100 database (European Molecular Biology Laboratory-European 
Bioinformatics Institute [EMBL-EBI], http://www.ebi.ac.uk/Tools/blastall/) 
(Table 1). 

To screen genetic diversity of the 11 EST-SSRs, we genotyped 60 E. 
gomphocephala individuals from two natural populations: 'Yalgorup' (voucher: 
R. Davis 1390, PERTH 04473167) and 'Ludlow' (voucher: C. A. Gardner, PERTH 
01350501) (Table 2). Genetic diversity parameters were calculated with 
GenAlEx version 6.4.1 (Peakall and Smouse, 2006). Deviation from Hardy- 
Weinberg equilibrium (HWE) and linkage disequilibrium among loci were 



Table 2. Population genetic properties of the 1 1 newly developed EST- 
SSRs in two natural populations of Eucalyptus gomphocephala. a 



Locus 


Yalgorup National Park 
(n = 30) 




Ludlow Tuart Forest 
(n = 30) 






H e 






H e 


EGM09 


4 


0.400 


0.639 


4 


0.233 


0.558 


EGM12 


4 


0.600 


0.629 


4 


0.621 


0.640 


EGM14 


7 


0.733 


0.779 


8 


0.733 


0.776 


EGM24 


3 


0.345 


0.580 


3 


0.172 


0.506 


EGM25 


2 


0.233 


0.255 


2 


0.207 


0.238 


EGM30 


9 


0.667 


0.728 


12 


0.833 


0.839 


EGM35 


9 


0.667 


0.832 


6 


0.767 


0.748 


EGM37 


2 


0.433 


0.495 


2 


0.300 


0.406 


EGM46 


12 


0.867 


0.816 


9 


0.862 


0.861 


EGM47 


3 


0.533 


0.515 


3 


0.500 


0.452 


EGM48 


6 


0.833 


0.776 


5 


0.862 


0.794 



Note: A T = total number of alleles observed; H e = expected heterozygosity; 
H Q = observed heterozygosity; n = number of individuals sampled. 

a Geographic coordinates for each population are: Yalgorup National 
Park = 32°51 , 17 ,, S, 115°39 , 54 ,, E; Ludlow Tuart Forest = 33°34 , 42 ,, S, 
115°29'42"E. 



calculated with GENEPOP version 4.0.10 (Raymond and Rousset, 1995; 
Rousset, 2008), using the Bonferroni correction for multiple testing. The total 
number of alleles per locus ranged from two to 13 (mean = 6) (Table 1). Ob- 
served and expected heterozygosities ranged from 0.172 to 0.867 and 0.238 to 
0.861, respectively (Table 2). Significant deviation from HWE was detected for 
EGM09 at Ludlow. We did not detect evidence of linkage disequilibrium be- 
tween any pair of loci in more than one population. The potential presence of 
null alleles and their frequency (r) in each population were estimated using 
MICRO-CHECKER version 2.2.3 (van Oosterhout et al, 2004), and were pre- 
dicted for EGM09 and EGM24 in both populations (r range = 0.17-0.29) and 
for EGM35 at Yalgorup (r = 0.09). The potential for stuttering was predicted for 
EGM09 at Ludlow. 

All 17 primer pairs that initially amplified E. gomphocephala DNA were 
screened for cross-amplification in three additional species, belonging to di- 
verse sections within two subgenera: E. camaldulensis (sect. Exsertaria) and 
E. victrix (sect. Adnataria) (both subg. Symphyomyrtus), and E. marginata 
(sect. Longistylus) (subg. Eucalyptus) according to the methods described 
above. Samples were collected from natural populations (Appendix 1). Sixteen 
out of 17 (94%) loci cross-amplified within subg. Symphyomyrtus, and 14 out 
of 17 (82%) cross-amplified across the subgenus boundary to subg. Eucalyptus, 
demonstrating high transferability rates within the genus. 



CONCLUSIONS 

This new set of 1 1 polymorphic EST-SSR loci will enable the 
characterization of population genetic diversity and structure 
throughout the species' range of E. gomphocephala in conjunc- 
tion with putatively neutral, nuclear genomic SSRs (gSSRs). 
Given their gene-linked nature, the EST-SSRs can be used to 
test for signatures of selection in natural populations. The high 
transferability of the loci demonstrates their suitability for ap- 
plication to other species of ecological, conservation, or eco- 
nomic importance in the genus. The broader set of 1 33 EST-SSRs 
represents a resource that could be exploited to optimize addi- 
tional gene-linked EST-SSRs for Eucalyptus and further expand 
the types of genes available for investigation. Data obtained 
using these markers will be particularly valuable to inform seed 
sourcing and conservation strategies in natural E. gomphocephala 
populations. 
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Appendix 1. Cross-species amplification of 17 EST-SSR loci from Eucalyptus gomphocephala to three additional eucalypt species from two subgenera. a ' b>c 



Locus Annotation d Source e ' f E. gomphocephala E. camaldulensis E. victrix E. marginata 



EGM09 


Glucan endo-l,3-beta-glucosidase 


E. globulus 


+ 


+ 


+ 


+ 


EGM12 


CONSTANS-like protein COl 


E. tereticornis 


+ 


+ 


+ 


+ 


EGM14 


Heat shock transcription factor 21 


E. grandis 


+ 


+ 


+ 


+ 


EGM24 


Putative S-adenosylmethionine decarboxylase (SAMDC) 


E. globulus 


+ 


+ 


+ 


+ 


EGM25 


Metal transporter Nramp3 


E. globulus 


+ 


NA 


+ 


NA 


EGM30 


Chlorophyll A/B binding protein, putative 


E. gunnii 


+ 


+ 


+ 


+ 


EGM35 


Fructose- 1 ,6-bisphosphatase, cytosolic 


E. globulus 


+ 


+ 


+ 


+ 


EGM37 


Quinone oxidoreductase 


E. globulus 


+ 


+ 


+ 


+ 


EGM46 


Desiccation protectant protein Lea 14 homolog 


E. gunnii 


+ 


+ 


+ 


+ 


EGM47 


Small GTPase Rab2 


E. grandis + E. gunnii 


+ 


+ 


+ 


+ 


EGM48 


Aquaporin PIP2;1 


E. gunnii 


+ 


+ 


+ 


+ 


EGM28 


Chloroplastic phosphoglycerate kinase 


E. globulus + E. grandis 


+ 


+ 


+ 


NA 


EGM19 


Aquaporin (PIP1) 


E. globulus + E. gunnii 


+ 


+ 


+ 


+ 


EGM33 


Synaptobrevin-related protein 1 (SARI) 


E. tereticornis 


+ 


+ 


+ 


+ 


EGM34 


Rapid alkalinization factor (RALF) 


E. globulus 


+ 


+ 


+ 


+ 


EGM26 


Magnesium transporter CorA-like family protein MRS2-2 


E. globulus 


+ 


+ 


NA 


NA 


EGM42 


bZIP (basic leucine zipper) transcription factor 


E. globulus 


+ 


+ 


+ 


+ 



Note: + = positive amplification; NA = no amplification or inconsistent amplification. 

a Eucalyptus gomphocephala is taxonomically classified within the monotypic sect. Bolites (Brooker, 2000) but has been shown to have affinities with 
sect. Bisectae I according to ITS (Steane et al., 2007) and Diversity Arrays Technology (DArT) analyses (Steane et al., 2011). 

b Taxonomic information for tested species: E. gomphocephala DC. = sect. Bolites/Bisectae /, subg. Symphyomyrtus; E. camaldulensis Dehnh. = sect. 
Exsertaria, subg. Symphyomyrtus; E. victrix L. A. S. Johnson & K. D. Hill = sect. Adnataria, subg. Symphyomyrtus; E. marginata Sm. = sect. Longistylus, 
subg. Eucalyptus. 

c Source locations: E. camaldulensis = 23°06'S, 1 19°34 r E (n = 2) and 23°10 , S, 1 19 0 45 r E in = 1); E. victrix = 22°55 , S, 1 19 o 10 r E (n = 3); E. marginata = 
31 0 56 r S, lUH^E (n = 1), 32°46 r S, 116°26'E (n = 1), and 32 0 42 r S, 116°03'E (n = 1). 
d Predicted gene annotation based on BLAST searches. 
e Species from which the original EST was derived. 

f Taxonomic information for source species: E. globulus Labill. = sect. Maidenaria, subg. Symphyomyrtus; E. gunnii Hook. f. = sect. Maidenaria, subg. 
Symphyomyrtus; E. tereticornis Sm. = sect. Exsertaria, subg. Symphyomyrtus; E. grandis W. Hill = sect. Latoangulatae, subg. Symphyomyrtus. 
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